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Abstract. It is shown that product BMO of S.-Y. A. Chang and R. Fefferman, defined on 
the space E rfl <g> • • • K dt , can be characterized by the multiparameter commutators of Riesz 
transforms. This extends a classical one-parameter result of R. Coifman, R. Rochberg, and 
G. Weiss @, and at the same time extends the work of M. Lacey and S. Ferguson [12] and 
M. Lacey and E. Terwilleger [19], on multiparameter commutators with Hubert transforms. 
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1. Introduction 



In one parameter, a classical result of Coifman, Rochberg and Weiss [8], in turn an extension 
^SJ ■ of the result of Nehari [26], shows that a function in the Hardy space H 1 on the ball can be 
weakly factored as a sum of products of functions in if 2 on the ball. Recently, Ferguson and 
Lacey [12] and Lacey and Terwilleger [19] proved the corresponding weak factorization for 
H 1 of the polydisc. In this paper, we prove the real variable generalization of these two sets 
of results. 

^" '■ def 

Let Mb ip = b-ipbe the operator of pointwise multiplication by a function b. For Schwartz 
functions / on M. d , let Rj / denote the jth Riesz transform of /, for 1 < j < d. From time to 
time, we will use the notation R for the identity operator. 

We are concerned with product spaces IR^* = IR dl <g>- • -(g)R rft for vectors d = (d\, . . . , d t ) G N*. 
For Schwartz functions b, f on IR d , and for a vector j = (ji, . . . ,jt) with 1 < j s < d s for 
s = 1, . . . , t we consider the family of commutators 

(1.1) Cjft/) = [■••[[M 6 ,R liil ],R 2j2 ],.-.],R t)it ]/ 



where R Sj ,- denotes the jth Riesz transform acting on 
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Main Theorem. We have the estimates below, valid for 1 < p < oo. 

(1.2) supHC^MIUp — ||°||BMO • 

3 

By BMO ; we mean Chang-Fefferman BMO. 

To establish this result, we find it necessary to prove an extended version of this Theorem, 
proving the equivalence of norms not only for the Riesz transforms, but also a class of 
singular integral operators whose symbols are supported on cones. In this, and other ways, 
our methods shed new light on issues related to commutators even in the one parameter case. 

It is well known that the result above has an equivalent formulation in terms of weak fac- 
torization of Hardy space; indeed, this equivalence is important to the proof of the Theorem. 
For J a vector with 1 < j s < d s , and s = 1, . . . , t, let ITj* be the bilinear operator defined by 
the following equation 

{Cfaf),g) = (b,Il^f,g)). 

One can express Ilj-as a linear combination of products of iterates of Riesz transforms, R SJ - S , 
applied to the / and g. It follows immediately by duality from the Main Theorem that for 
sequences f k ,g J k G L 2 (R d ) with Y,]Ylt=i\\fkh\Wkh < oo we have 

oo 

EEW^D^W 

3 k=l 

With this observation, we define 

oo 

(1.3) L 2 (RJ)QL 2 (W?) d ^ {/ G L\R d ) : / = ^E^'^} " 

j k=l 

This is the projective product given by 

ii/ii^(R<*)0£»cR ,r ) ^ f inf {EEn^ii 2 ii^ii 2 } 

3 k 

where the infimum is taken over all decompositions of / as in (11.31) . This definition has an 
obvious extension to L P QL P , for 1 < p,p' < oo. We have the following corollary. 

Weak Factorization Theorem. For any 1 < p < oo, let p' =p/(p — l) be the conjugate in- 
dex. We have ^(W 1 ) = L p (R d )©L p '(R d ). Namely, for any f G iJ 1 (lR a! ) there exist sequences 
f[e LP(M?) and g{ G such that 

oo 

3 k=l 
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\\f\\ H ^Y,Y,\\fl\\p\\&- 

J k 

The result of Coifman, Rochberg and Weiss |8] has found a number of further applications. 
The original paper includes a weak factorization result for certain Bergman spaces, and there 
is the striking application to the theory of compensated compactness [7J. We anticipate that 
some of these applications persist into the higher parameter setting of this paper, but we do 
not purse these points in this paper. 

The proofs given here are rather different from that of Coifman, Rochberg and Weiss 
[8]. Their proof of the upper bound on commutator norms relies upon a sharp function 
inequality, a method of proof that is quite powerful in the one parameter setting. This 
method admits only a weak extension to the higher parameter setting; instead, our proof of 
the upper bound, namely Theorem 15.31 follows from the decomposition of the commutators 
into a sum of simpler terms. These terms are paraproducts, composed on either side by 
Calderon-Zygmund operators. This method has been used in different settings, such as 
Petermichl [27], and Lacey [TSJ . Our formalization of this method in this paper could lead 
to further applications of this method. 

The paraproducts that arise are of multiparameter form. The specific result needed is 
Theorem !4.4l below. This result is due to Journe [13]; more recent discussions of paraproducts 
are in rj7J[23l[2S] • 

For the lower bound, namely Theorem 16.11 we use the strategy of Ferguson-Lacey [12] and 
Lacey-Terwilleger [15] . One inducts on parameters, using a boot-strapping argument, and 
the Journe Lemma [TJJ. However, to implement this strategy, we have to prove a second 
version of the Main Theorem, one in which the Riesz transforms are replaced by the a family 
of Calderon-Zygmund operators whose symbols are adapted to cones. These kernels are 
described in the text preceding (15.81) . and Theorem 15. 101 is the extended version of our Main 
Theorem. 

To start the induction, in the case of Riesz transforms, we can of course use the Coifman, 
Rochberg, and Weiss result. But for the cones, we appeal to the results of Uchiyama [29J and 
Song-Ying Li [21] which are deep extensions of the work of Coifman, Rochberg and Weiss. 

In § [2]— [H we recall different aspects of the multiparameter theory in forms required for 
this investigation. £J5] introduces the cone operators, and this establishes the upper bound on 
commutator norms. The initial stages of the lower bound on commutator norms is proved in 
§6j The more refined bootstrapping argument occupies $7J 

Acknowledgment. The authors benefited from the Banff Research Station, through the Re- 
search in Teams program, and a very pleasant stay at the Universite de Bordeaux. The 
authors also enjoyed a stay at Texas A&M University. 
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2. Wavelets in Several Dimensions 

This discussion is initially restricted to a one parameter setting. We will use dilation and 
translation operators on M. d 

(2.1) Tr y f(x) d ^ f(x-y), y G R d , 

(2.2) Dil<f } f(x) = a- d / p f{x/a) , a>0,0<p<oo. 
These will also be applied to sets, in an obvious fashion, in the case of p = oo. 

By the (d dimensional) dyadic grid in W 1 we mean the collection of cubes 

V d = {j2 k +[0,2 k ) d : jez d , kez}. 

Wavelets arise from a mean zero Schwartz function w, a scaling function, expressible as 
W(x) — 2W(2x), for a father wavelet W. The principle requirement is that the functions 
{Tr c m Dilj w : / G T>\\ form an orthonormal basis for L 2 (IR). Except for the fact that it 
is not smooth, h = —1 (0,1/2) + l(i/2,i) is a scaling function, with father wavelet l(o,i)- This 
generates the Haar basis for L 2 (IR). 

For e G {0, 1}, set w° = w and w 1 = W, the superscript denoting that 'the function 
has mean 0,' while a superscript 1 denotes that 'the function is an L 2 normalized indicator 
function.' In one dimension, for an interval /, set 

wj = Tr c(J) DilfJ w £ . 

The father wavelet is of some convenience to us, as we have the useful facts, valid on the 
interval J0 

(2.3) Y,(f,w I )w I ={f,w 1 J )w 1 J , 

IDJ 

We will use the Meyer wavelet in later sections of the paper. This wavelet, found by 
Y. Meyer [2"2"ll2"3"j . arises from a Schwartz scaling function w, with w supported on 1/3 < |^| < 
8/3. Indeed, w is identically equal to 1 on the intervals 1 < |£| < 2. The 'father wavelet' W 
is a Schwartz function with W supported on |£| < 2, so that w{x) = W(x) — 2W(2x). One 
of the reasons this is such a useful wavelet for us is the fact below which is exploited several 
times. 

(2.4) 8|/| < |/'| implies wj^wjf is supported on (4|/|) _1 < |£| < 3|/| _1 . 



Technically, these results are only true for multiresolution analysis (MR A) wavelets. Both the Haar and 
Meyer wavelets are MRA wavelets. 
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Let Sig d == {0, l} d — {1}, which we refer to as signatures. In d dimensions, for a cube Q 
with side |/|, i.e., Q = I\ x • • • x I d , and a choice of e G Sig d , set 

d 

w £ Q (x 1 ,...,x d ) = Ylw'/.^Xj). 

3=1 

It is then the case that the collection of functions 

Wavelet^ = {w £ Q : Q G V d , e G Sig d } 

form a wavelet basis for L p {R d ) for any choice of d dimensional dyadic grid V d . Here, we are 
using the notation 1 = (1, . . . , 1). While we exclude the superscript 1 here, it plays a role in 
the theory of paraproducts. 

We will use these bases in the tensor product setting. Thus, for a vector d = (dx, . . . ,d t ), 
and 1 < s < t, let V dg be a choice of d s dimensional dyadic grid, and let 

Also, let Sigj- = f {e = (ei, . . . , e t ) : e s G Sig ds }. Note that each e s is a vector, and so e is a 
'vector of vectors'. For a rectangle R — Q\ x • • • x Q t , being a product of cubes of possibly 
different dimensions, and a choice of vectors e G Sig^ set 

t 

wi(x u ...,x t ) = Y[w £ ^ a (x s ). 

s=l 

These are the appropriate functions and bases to analyze multiparameter paraproducts and 
commutators. 

Let 

Wavelet^-. = f {w s R : ReVj, e G Sig d -} . 
This is a basis in L p (IR d ), where we will use the notation 

R d ~ d ^ f ]R d i ® . . . o E d * 
to emphasize that we are in a tensor product setting. 

3. Chang-Fefferman BMO 

We describe the elements of product Hardy space theory, as developed by S.-Y. Chang 
and R. Fefferman [5|[6|[9HTT] as well as Journe [13j[Tl]. By this, we mean the Hardy spaces 
associated with domains like ®^ =1 IR ds . 
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Remark. The (real) Hardy space H l (WL d ) typically denotes the class of functions with the 
norm 

d 

£iir*/Hi 

3=0 

where Rj denotes the jth Riesz transform. Here and below we adopt the convention that 
Ro, the Oth Riesz transform, is the identity. This space is invariant under the one parameter 
family of isotropic dilations, while H l (M> d ) is invariant under dilations of each coordinate sep- 
arately. That is, it is invariant under a t parameter family of dilations, hence the terminology 
'multiparameter' theory. 



As before, the space H 1 (M> d ) has a variety of equivalent norms, in terms of square functions, 
maximal functions and Riesz transforms. For our discussion of paraproducts, it is appropriate 
to make some definitions of translation and dilation operators which extend the definitions 
in (12.1 1) — (12.21) . (Indeed, here we are adopting broader notation than we really need, in 
anticipation of a discussion of multiparameter paraproducts.) Define 



DiP 



oi, ...,at 



TT y f(x) = f(x-y), yeR d , 
t 

f(x 1 ,...,x t ) = Y[a~ ds/p f(x 1 /a 1 , . . . ,x t /a t ) , a 1 ,...,a t > 0, 



s=l 

Bi\ p R d ^ Tr c(jR) r>ilf Ql |,.„,| Qd | • 



(3.1) 
(3.2) 

(3.3) 

In the last definition R — Qi x • • • x Q t is a rectangle, each Q s is a cube and the dilation 
incorporates the locations and scales associated with R. c(R) is the center of R. 

For a non- negative smooth bump function (f 1 in IR d with J ip l dx = 1, define the (strong) 
maximal function by 

Mf(x) ^ su P Dil^ 1 (x)(/,Dil^ 1 ). 
For s = 1, . . . , t, choose radial bump functions (p° s on R da with f Rds (p® dx s = and 



Then, fix (f° so that 



sup 

5 



y?°(xi, 



9 s (t0\ 2 j <oo 



■ Xt) 



def 



As an analog of the Littlewood-Paley square function, set 

d = f [^[Dil^°(x)] 2 |(/,Dil^ )| 2 



S/(x) 



1/2 
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Theorem 3.4 (Equivalent forms of H 1 norm). All of the norms below are equivalent, and 
can be used as a definition of H 1 {R d ). 

t 

iiM/iix-ii/iii + ns/ii!- EllIK^IL- 

0<j<d s=1 

K s ,j s is the Riesz transform computed in the j s th direction of the sth variable, and the Oth 
Riesz transform is the identity operator. 



3.1. BMO(R^). The dual of the real Hardy space is H 1 ^)* = BMO(M (? ), the t-fold product 
BMO space. It is a Theorem of S.-Y. Chang and R. Fefferman [6] that this space has a 
characterization in terms of a product Carleson measure. 



Define 

def 



(3.5) ii&Hbmo(*> = supjit/r 1 e E K 6 »«4)i 7lJ 



-RcC/eeSig,- 



Here the supremum is taken over all open subsets t/ct with finite measure, and we use a 
wavelet basis w e R . 

Theorem 3.6 (Chang-Fefferman BMO). We have the equivalence of norms 



I (H 1 (M. d ))* WJ llBMO(M d )- 

That is, BMO(R^) is the dual to H X (NL*). 



3.2. Journe's Lemma. The explicit definition of BMO in (I3.5P is quite difficult to work 
with. In the first place, it is not an intrinsic definition, in that one needs some notion of 
wavelet to define it. Secondly, the supremum is over a very broad class of objects: All open 
subsets of lR d of finite measure. There are simpler definitions (also unfortunately not intrinsic) 
that in particular circumstances are sufficient. 

Say that a collection of rectangles U C has t — 1 parameters if and only if there is a 
choice of coordinate s so that for all R,R'eU we have Q s = Q' s , that is the sth coordinate 
of the rectangles are all one fixed d s dimensional cube. 

We then define 

(3-7) II/IIbmo_ i( *) = SU P [|sh(W)l _1 E EKJ>*>l ?lJ 2 

U has t — 1 L „ „ - 

parameters rtfcM efcsig^ 
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A collection of rectangles has a shadow given by sh(U) = \J{R : ReU}. We use the ' — 1' 
subscript to indicate that we have 'lost one parameter' in the definition^ Motivation for this 
definition comes from our use of induction on parameters in the proof of the lower bound for 
the commutators. See ^6.11 



L. Carleson [4] produced examples of functions which acted as linear functionals on H 1 (IR d ) 
with norm one, yet had arbitrarily small BMO_i norm. This example is recounted at the 
beginning of R. Fefferman's article |lUj . 

Journe's Lemma permits us, with certain restrictions, to dominate the BMO norm by the 
BMO _i norm. We need a version of this statement with an additional refinement, see ( 13.1 Op 
that first appeared in [T2], and is important to our 'bootstrapping' argument in §0 

Lemma 3.8 (Journe's Lemma). LetU be a collection of rectangles whose shadow has finite 
measure. For any r\ > 0, we can construct V D sh(U) and a function Emb : U — ► [1, oo) 
so that 

(3.9) Emb(i?) • R C V, ReU, 

(3.10) \V\ < (l + 7j)|sh(W)|, 

(3.11) \Y. E Emb( J R)- c (/,^) W 

R£U eeSig d - 



< K ,, , 

— J ^VUJ llBMO- 



The constant K v depending only on r] and d, and the constant C , appearing in the last display, 
upon the vector d. 



Notice that the power on the embeddedness term in (13. lip is allowed to be quite big, a 
function of the parameters d that we do not specify. Also, concerning the conclusions, if 
we were to take Emb(i2) = 1, then certainly the first conclusion (I3.9P would be true. But, 
the last conclusion would be false for the Carleson examples in particular. This choice is 
obviously not permitted in general. 

The formulations of Journe's Lemma given here are not the typical ones found in Journe's 
original Lemma, or J. Pipher's extension to the three dimensional case [28]. These papers give 
the more geometric formulation of these Lemmas, and J. Pipher's article implicitly contains 
the geometric formulation needed to prove the Lemma above (provided one is satisfied with 
the estimate |V| < |sh(W)|). See Pipher [2B]- Lemma I3T8"1 as formulated above, was found in 
Lacey and Terwilleger [15] ; the two dimensional variant (which is much easier) appeared in 
Lacey and Ferguson [12]. The paper of Cabrelli, Lacey, Molter and Pipher [3] surveys some 
issues related to Journe's Lemma. See in particular Sections 2 and 4. We refer the reader to 
these references for more information on this subject. 

2 In the two parameter case, our definition of BMO_i is actually a slightly larger space than the more 
familiar rectangular BMO space. 
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4. Paraproducts 

The paraproducts that arise are of a somewhat general nature, and so we make some 
definitions which will permit a reasonably general definition of a paraproduct. 

Let x( x ) = (1 + \ x \ 2 ) _1 - Let Xq — Dilg x- Say that ip is adapted to Q iff 

(4.1) \V m V (x)\<\Q\- m [x%\x)}\ xeR d . 

This inequality should hold for all derivatives D m , where m < d + 1, where ti is the ambient 
dimension. The inequality should hold for all integers N. The implied constant can depend 
upon these parameters. Say that ip has a zero iff J (p dx — 0. 

We extend these definitions to functions <p on M. d . Say that ip is adapted to R = \\Q S \i 
and only if 

t 

(4.2) v{ x ii ■ ■ ■ i x t) — W^s^s) , where (p s is adapted to Q s . 

s=l 

Say that ip has zeros in the sth coordinate if and only if 

(4.3) / ip(xi,...,x s ,...x t ) dx s = 0, for all xi, . . .x s _i,x s+ i . . 

The main Theorem on paraproducts that we will need concerns bilinear operators formed 
in this way. For j = 1,2,3 let {y2j,,R : R G Vj} be three families of functions adapted to the 
dyadic rectangles in T>g. Then define 

B(/i,/ 2 ) = IDI1/2 U2» ¥>3,.R- 

fleuj- 1 1 

The following result is due to Journe (USEE]- -^lso see [T7 1 I24 1 I25]. 

Theorem 4.4. Assume that the family {(fi^} has zeros in all coordinates. For every other 
coordinate s, assume that there is a choice of j = 2,3 for which the the family {(pj,ii} has 
zeros in the sth coordinate. Then the operator B enjoys the property 

B : BMO x L p — ► LP , 1 < p < oo . 

We will refer to the function (pi as the symbol of the paraproduct. This function plays the 
same role for paraproducts as does the symbol of the commutator. Particularly relevant for 
us is the following reformulation of this theorem: If Bx and B 2 are bounded paraproducts, 
then the tensor product Bi ® B 2 is a bounded paraproduct for symbols on the corresponding 
product BMO space. 
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In many applications of this result, the functions <pi t R, acting on the symbol of the para- 
product, will be product wavelets. 



A more particular form of the upper bound on commutators plays a role in both the upper 
and lower bounds for our Main Theorem. We state this variant of Theorem 14.41 for our use 
below. In particular, the estimate ( 14.111) is used in the lower bound. It holds when the 
symbol and the function the paraproduct is applied to have 'separated wavelet support' in 
the sense of ffl~TUj) . 

For a subset of coordinates J C {1, . . . , t} set 

(4-5) F^ f £ AF fc> 

"?eSig d - kei 1 

k s =i s , seJ 

def \ -* \ ^ g k> „..£ 



(4.6) AF fc ^ f £ £ 



W r, W , 



R w W R • 

eesig d - Rev s 

\Q s \=2 h » 

For those coordinates s G J, we take the wavelet projection onto that scale, while for those 
coordinates s G" J, we sum larger scales. That means that we lose the zero in the coordinates 
not in J. 

Write R' <j R if and only if \Q' S \ < \Q S \ for s £ J and \Q' S \ = \Q S \ for s G J. 

Theorem 4.7. Let T be a product Calderon-Zygmund kernel as in Theorem \ 5.2 . For all 
J C {1, . . . , t}, and k G Z* with 

(4.8) 3<k 8 <8,s£J, -8<k s <8,seJ. 

We have 

(4-9) ||^(AF f 6)-TF f+fcV ^|| 2 < ||6|| BMO(Ktr) ||^|| 2 . 

fez* 

Moreover, suppose we have the following separation condition: Fix an integer A > 0. Suppose 
that 

(4.10) if for e and e', (b, iwf,) ^ 0, (<p, w%) ^ with R' <j R, then ARDR' = 0. 
We then have the estimate 



< A- 100t \ 



(4.11) ||E( AF f & )- TF f + M^ 

fez* 

— * 

Implied constants are independent of the choice of k. 



MULTIPARAMETER RIESZ COMMUTATORS 



11 



The operator in (|4.9p . though it fits into the category of paraproducts, it does not fit the 
precise definition we have given of a paraproduct above, and so we will postpone the proof 
of this Theorem until the end of this section. 

Shift Operators. There are different types of 'shifts' on wavelets that also enter into our 
considerations. These are shifts of signature, scale and location. We discuss each of these. 

Define a 'signature shift' operator by a map e : Sigj*x Vj — ► Sigj-, which e(-, R) is one to 
one for each rectangle R. Then the operator is defined first on wavelets by 

HI S i g>e (<4) = wf R) 

and then extended linearly. The boundedness properties of these operators are straightfor- 
ward. 

Proposition 4.12. We have the estimate 

1 1 LUsi g ,£ 1 1 p^p < C p , l<p<oo. 

The proof follows immediately from the Littlewood-Paley inequalities. We omit the details. 

Define a 'scale shift' operator by a one to one map a p : T>j — > 2)^ that sends each dyadic 
rectangle R into a unique u(R) C R, so that the ratio p = \a(R)\/\R\ is independent of R. 
The parameter of this operator is p. Define a corresponding linear operator ni sca i eiP by 

IH S cale,p(wfj) = f y/p ■ W £ a(R) 

and the operator is then uniquely defined by linearity. Our observation is that this shift is a 
uniformly bounded operator on product BMO 

Theorem 4.13. The operators HI S caie,p 'map BMO(lR d ) to itself. Moreover for all k > we 

have the estimate 

||LU sca ie,p||BMO^BMO ^ P * ■ 

Proof. Given / e BMO, and open set U C M. d , consider the set 

V = {M l v > cp} 

where M is the strong t parameter maximal function appropriate to this setting, namely 

Mf = sup / \f(y)\dy. 
Rev s \ J 2 r 

Observe that for appropriate c if cr(R) C U then we have R C V. 
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We can estimate 

E EK m -ie,,/,<>i 2 = p E Ek/'<-WI 2 

<p E Ek/'<)i 2 

£GSig d -i?cy 

<pIHIbmoM- 

It remains to estimate |V| in terms of \U\. 

Using the LP mapping properties of the maximal function, we can estimate 

\V\ <P~ P \U\. 

Taking p = 1 + k proves our theorem. □ 

Remark. When the number of parameters t — 1, the operators HI sca i CiP are in fact uniformly 
bounded on BMO as follows from the weak L l bound for the maximal function. For t > 1, 
there is a logarithmic estimate. 

IIbmo-^bmo ^ (1 + l°g 1/ pf- 
The strong maximal function we are using satisfies ||M||p^ p < (p — aside from dimen- 

— * 

sional considerations from the individual components of d. Using this estimate, and taking 
p — 1 ~ |logp| _1 , the estimate above follows. 



We define 'location shift' operators. Let A n : T>j — ► I^be a one to one map such that for 
all rectangles R G Uj, the image rectangle A„(i?) has the same dimensions in each coordinate, 
namely 

\Q S \ = \K(Q)s\ , i<s<t. 

Moreover, \(R) C nR. The shift operator is then defined on wavelets by 

(4-i4) m loc , n w e R = wi n{R) , 

and is then extended linearly. The parameter of this operator is said to be n. 

The estimate we need concerns the LP norms of this operator. 
Proposition 4.15. We have the estimates below, valid for all integers n. 

IIIII, II < T7 |d "i 

— * — * 

where \d\ — d± + d 2 + • — h d t depends only on d. 



Proof. Since A„ is one to one, it is clear that UIi oc is bounded with norm one on L? . For 
p 7^ 2 we use the Littlewood-Paley inequalities, together with the obvious fact that 
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for all rectangles R. Then, using the Fefferman-Stein Maximal inequality, we have 

.,A|2 ,1/2 



13 



I in 



loc 



(R) 



< 



n< 



< n\ d 



E E 
E E 



K/>S> I 2 



iMl R f 



p 

nV2 



10^4) I 2 

I-RI 



1/2 



□ 



Generalized Paraproducts. Experience shows that paraproducts arise in a variety of ways. 
They do in this paper, and in this section, we adopt a notation to formalize the different 
ways that the paraproducts arise. 

Given an operator P acting on L p (IR d ), we set 
(4-16) ||P|| Para = inf{^^^JV 1/|dl |c(e,p,n)|} 

e p n 

where the infimum is taken over all representations 

P / = ^2 S C ( 6 ' P' ™) ' B f,P,«( m scale,p b, LHsig,e HIloc,n /) • 

e p n 



In this display, the operators B e p n are paraproducts as in Theorem 14.41 with norm at most 
one. The operators HI sca i eiP are scale shift operators, with parameter p; the ILIsig,e are 
signature shift operators; and IIIi OC)n are location shift operators of parameter n. 

We may combine the different results of this section into the estimate 



(4.17) 



ip->p ~ 



< IIP! 



Para 



1 < p < OO 



Examples of how to use this norm are in the next proof. 



Proof of Theorem \4- 7[ We will assume that the Calderon-Zygmund operator T is the identity. 
It is straightforward to supply the necessary additional details to accommodate the general 
case. 
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The 'father wavelet' W permits us to rewrite the operator in (14. 5p . For a subset of coor- 
dinates J C {1, . . . , t} we set 

Wr,j{x x , ...,x t ) = Y[w £ ^ g (x s ) •Y[W Q .(x s ) . 

se.J s^j 

Thus, in the coordinates in J we take the Meyer wavelet, and for those coordinates not in J 
we take a father wavelet. In particular, Fj-j as defined in (14. 5 ft is 



f ij = E E w *v ® w *v ■ 



?6Si gd- Rev* 

\Rs\=2 l ° 



Let k e Z* be as in (14.81) . Let i?, i?' be dyadic rectangles with 
(4.18) \Q S \ = 2 k °\Q' s \ , l<s<t, A~Ml R (e(R')). 



The function 



Cr,r'j = A~ N ^\R\w R -W R/ ,j 



— * 

is adapted to R, in the sense of (14.21) . Here N is a fixed large constant depending upon d. 

The assumption (14. 8p plays an essential role in describing the zeros of the function Cr,R',J- 
W R i } j has zeros for s 6 J, but certainly does not have zeros for s (jL J . The properties of 
the Meyer wavelet, and in particular (12.41) . along with the assumption on k then imply that 
(r,r',j has zeros for s ^ J. 

Now, consider a map tta '■ T>j — > T>j such that the pairs R,tt(R) satisfy (I4.18p . Set 
/ti(7r) = A where A is as in (I4.18p . The operator 

B n (b,tp) = b, ^r (<P> W *(R),j) Cr,tt(R),J 

\R\ 



is a paraproduct, composed with a change of location operator. Note that the function that 
falls on b has zeros in all coordinates; the function that falls on ip has zeros for s G J, and 
(r,it(R) h as zeros for s ^ J. It is then clear that 

(4.19) \\B^\\ Palat <fM(7T) N . 

Now, a moments thought reveals that we can write, for appropriate choices of tt v , 

oo 

(4-20) £(A F f b) ■ F f+fcV = E B - ^ <P) ■ 

Fez* v=1 
Moreover, for all < A < 1, the number of tc v occurring in the sum above with //(7r„) ~ A is 
at most A~ c where C depends upon d. But then from (14.191) . it is clear that (14.91) holds. 
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The second conclusion of the Lemma, (14.111) . is quite important to the proof of our lower 
bounds on commutator normsB But with the assumption (14.101) . note that we can again 
have the equality ( I4.20p . but with this additional property: For all v, we have fi(n v ) < A. It 
is then clear that ( 14.111) holds. 

□ 



5. The Upper Bound 



Let K be a standard Calderon-Zygmund convolution kernel on M. d x IR d . This means that 
the kernel is a distribution that satisfies the estimates below for x ^ y 

\V j K(y)\<N\y\- d -i, j = 0,1,2, ... ,d + 1 . 

(5.1) 



The first estimate combines the standard size and smoothness estimate Jj The last, and 
critical, assumption is equivalent to assuming that the operator defined on Schwartz functions 
by 

Tjr/(s) = J K(x-y)f(y)dy 

extends to a bounded operator on L 2 (M. d ). The least constant N satisfying the inequalities 
( 15. ip and 1 1 1 1 2 »2 < iV is some times referred to as the Calderon-Zygmund norm of K . 

Now let Ki,...,K t be a collection of Calderon-Zygmund kernels, with K s defined on 
M. da x M ds . It is not obvious that the corresponding tensor product operator 

T Kl <g> • • • ® T Kt 

is a bounded operator on L p (M> d ). This is a consequence of the multiparameter Calderon- 
Zygmund theory. This is a basic fact for us, so we state it here. 

Product Calderon-Zygmund Kernels 5.2. Let Ki, . . . , K t be a collection of Calderon- 
Zygmund convolution kernels, with K s defined on M, da x M. ds . Then 

T Kl g) • • • <g) T Kt 

extends to a bounded linear operator from L p (IR d ) to itself for all 1 < p < 00. 



It is also not at all clear that the multiparameter commutators are bounded operators, 
even in the case of the Riesz transforms. Thus, this is one of the principal results of this 
paper. 

3 Estimates of this type are also important to detailed information about norm bounds for paraproducts. 
See OH §4.3]. 

4 Our proof requires a large number of derivatives on the kernel, due to an argument in H5.ll 
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Theorem 5.3. We have the estimates below, valid for 1 < p < oo. 
(5-4) llfrjc 1> ---fr Ji r i ,M 6 ]...]||^ 1 ,<||6||BMO. 

By BMO ; we mean Chang-Fefferman BMO. The implied constant depends upon the vector 
d, and the Calderon-Zygmund norm of the Tk s ■ 

There are two corollaries of this that we will use below. For a symbol b on IR d , define 

(5.5) IHlRicsz.p d = sup||[R ils • • ■ [%,M 6 ] • ■ -jWp^p, l<p<oo. 

where the supremum is formed over all choices of Riesz transforms R Ja for 1 < j s < d s ^ 
The Riesz transforms of course fall under the purview of the Theorem above, so we see that 
II &|| Ricsz, P < || b\\ bmo- This is half of our Main Theorem, and the other half is the reverse 
inequality. 

We will have need of another class of singular integral operators besides the Riesz trans- 
forms, with the Fourier transform of these kernels-the symbol of the kernel-being well 
adapted to a cone in W 1 . 

Suppose that the dimension d > 2. A cone C C W 1 is specified by the data (£cs Q) where 
£c G M. d is a unit vector referred to as the direction of the cone and Q C IR ' -1 is a cube 
centered at the origin. The cone consists of all vectors 9 given in orthogonal coordinates 
(9^, &±) with 0£ = 9 ■ £, and 9± G 9%Q. For < A by AC we mean the cone with data 
(£cjA<5). By the aperture of C we mean \Q\. 

The Fourier restriction operator specified by C should be bounded on all LP spaces, Namely 
the operator defined by 

(5.6) V~ c f = l c f 

should admit a uniform bound on all L p (M d ) spaces. By taking the boundary of the cone 
to be a cube this is certainly the case: Compositions of Fourier projections onto half spaces 
yields Pc, so it will not be given by composition with respect to a (one parameter) Calderon- 
Zygmund kernel as in (15. ip . 

For a cone in C C M d , we fix a Calderon-Zygmund kernel Kg which satisfies the size and 
smoothness assumptions above, and in addition, 

(5.7) lc<K c <l { i +K)c . 

Here, we introduce a small parameter k which will depend upon dimension d. Moreover, 
we choose the cone operator to make a sufficiently smooth transition from to 1 that the 
operator Tc with symbol given by Kq defines a Calderon-Zygmund operator, bounded on 
all L p , 1 < p < oo. 



'In the case that d s = 1, the Riesz transforms reduce to the Hilbert transform. 
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There is however an essential point to observe: That the kernel Kq satisfies the Calderon- 
Zygmund estimates fl5.ll) . but with constants that tend to infinity as the aperture of the 
cone tends to infinity. In the limit, the kernels Kq tend to a projection of a one dimensional 
Calderon-Zygmund kernel @ 

But, with the aperture fixed, in each dimension, we can choose these kernels to be rotations 
of one another, so that they admit uniform bounds in L p (M. d ). We will refer to the operator 
Tc given by convolution with Kc as a Cone transform. 

As a matter of convention, in the case of d = 1, there are two cones, M±. The Cone 
transforms are the corresponding projections onto the positive and negative frequency axes. 
These are of course linear combinations of the identity and the Hilbert transform, which 
coincide with the Riesz transforms. 

We now define a third norm on a symbol b on WL d 

(5.8) ||&||cone, P = sup||[T Cl , • ■ ■ [T Ct , M 6 ] ■ • -Wlp^p, l<p<oo. 

where the supremum is formed over all choices of Cone transforms Tc s with C s C M. de in 
which the aperture of the cone is fixedQ It follows that we also have ||6||cone,p ^ ||&||bmo- 
This is an important observation for us, so let us formalize it in the following Corollary of 
Theorem I5.3[ which includes half of our Main Theorem. 

Corollary 5.9. We have the inequalities 

||6||Riesz,p , ||fr||conc,p < ||&||bMO , 1 < £>< OO . 

For the inequality concerning Cone operators, the implied constant depends upon the aperture 
of the cones. 

Remark. In the one dimensional case, a 'cone' is just a projection onto the positive axis say, 
and most of the considerations of this section are not needed. For the sake of exposition, 
in this section we will assume that all the coordinates of d — (d\, . . . , d t ) are at least two. 
The case when some coordinates are one is technically easier, but more difficult in terms of 
accommodating the general argument into the notation. 

Let us formalize the extended version of our Main Theorem. 
Extension of Main Theorem 5.10. For allt >1 and choices of d we have 

||6||Riesz,p - ||&||cone,p - ||6||bMO , 1 < p < OO . 

The implied constants depend upon the vector d and the aperture of the cone. 

6 The operators admit uniform LP bound in the aperture, but we need to apply a Theorem of Song-Ying 
Li [5T] which only applies if the kernels are Calderon-Zygmund on M. d . 
^Later in the proof, we will specify an aperture. 
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We find it necessary to prove the equivalence between the BMO and Cone norms in order 
to deduce the equivalence with the Riesz norm. 

5.1. A One Parameter Result. A commutator is a special form of a paraproduct. Our 
approach to Theorem 15.31 is obtain a decomposition of a one parameter commutator into a 
sum of paraproducts. The tensor product of the elements of our decomposition are themselves 
bounded operators, so we can then pass to the multiparameter statement of the Theorem. 

Remark. The multiparameter setting is related to the tensor products of dilation groups. An 
essential difficulty is that the tensor product of bounded operators need not be bounded. See 
|16j . And so it will be incumbent upon us to describe sufficient conditions on the operators 
for the tensor products to be bounded, and reduce the commutators above to these settings. 

A result of this type, expressing a commutator as a sum of paraproducts, is known to 
experts, and has been used in pQ, and may well have been formulated in this way before. 

Proposition 5.11. For any Calderon-Zygmund kernel satisfying 115. and symbol b we can 
write the commutator [Tk, M&] as an absolutely convergent sum of paraproducts composed 
with signature, scale and location shifts. That is, using the notation in ^.16j) , 



Proof of Theorem \5.3l The Proposition above shows that a commutator is the absolutely 
convergent sum of bounded paraproducts. The result of Journe, Theorem I4.4[ is that the 
tensor product of bounded paraproducts is bounded. As the commutators in our Theorem 
act on a tensor product space, we see that the commutators can be written as an absolutely 
convergent sum of tensor products of bounded paraproducts. Hence, the Theorem follows. □ 

Proof. A basic fact here is that if is adapted to a cube Q, then so is Tk 4>- Clearly, Tk 4> 
has a zero. This in particular shows that for a paraproduct operator B, we have 



As we are working with convolution operators, we could use a classical Littlewood-Paley 
decomposition method to prove this result. We have however already introduced wavelets 
(which are essential later in this paper) so we prefer that method here. 

We recall that the Meyer wavelet w in one dimension has Fourier transform identically 
equal to zero on a neighborhood of the origin. It follows from the rapid decrease of the 
wavelet that we then have 



||[Tx,M 6 ]|| Para <l 



\\Tk ° B j | p ara + ||B O T^||p ara < ||B|| Para 



(5.12) 
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That is, the wavelet is orthogonal to all polynomials in x. This property extends to the 
multidimensional Meyer wavelet. 

Set 

(5.i3) f, d ^ f E w q® w q 

£eSig d \Q\>2i d 

be the Father wavelet projection. And set 

AF, d ^ f F j -F j+1 = E w h® w h 

£GSig d \Q\=2i d 

be the projection onto the wavelets of scale Q? . 

The property f!2.4[) is relevant to us. In particular, it follows from this that we have the 
Fourier transform of the product Aj Fj b ■ F J+3 / is localized to 2~ J ~ 3 < |£| < 2~ J+3 . 

We expand the commutator in these wavelet projections. Thus, 

[T K , M b ]f = J2i T K, M AFj b ] AF f f. 
3,3' 

The principal term arises from j + 3 < j', where we do not have any cancellation in the 
commutator, and we write 

E t T ^ m af, b ] AP> f = T K o B x (b, f) - B 2 (6, /) , 

j+3<j' 

Bi(fe, 4>) = ^AFjft-F^^, 

j 

B 2 (b,<f>) = ^AF^-T^oF^, 

i 

It is important that the product AFj b ■ Fj +3 f have no Fourier support in a neighborhood of 
the origin that has diameter proportional to 2~- J . Certainly, Bi is a paraproduct. It follows 
that T^oBi is as well. Upon inspection, one sees that B 2 is a paraproduct. It is also 
straightforward to verify that \\Tk o Bi||p ara + ||B 2 ||p a ra < 1- 

In the remaining cases we expect terms which are substantially smaller. The principal 
point is this estimate. For e, e' G Sig d , 
(5.14) 

|[t x ,m^]<(x)| < [^] 1+1/2d (i + ^^)- JV |Q|- 1/2 [xg ) W] 7V , 2 3d \Q\ > \Q'\- 

(2) _____ 

Here, Xq' is as in (14.11) . In the language of the section on paraproducts, this shows that a 
large constant times this function is adapted to the cube Q' . The power —N on the term 
involving distance holds for all large N; a power of N > d is required; The power 1 + l/2<i 
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on the ratio follows from the number of derivatives we have on the kernel in (15.11) ; 

some power larger than one is required. 

With the inequality (I5.14p . it is easy to verify that 

|| [M 6 , TV] ■ - T K B X (b, •) + B 2 (6, •) llPara < 1 • 



The proof of ( 15.141) is taken in two steps. We have 



|[Tk,M^]<4(x)| < (l + t-^y^Ql-^ixr , 2™\Q\ > \Q'\. 

That is, we do not have the term involving |Q|/|Q'| appearing on the right. This estimate is 
easy to obtain, and we omit the details. 



The second estimate is 
(5.15) 



[T^M w e]^,(x)|<[^] 1+1/d |g|-^[ x g)(x)] 



N 



2 3d \Q\ > \Q'\ 



That is, we have a slightly larger power of |Q|/|Q'| than is claimed in (I5.14p . Taking a 
geometric mean of these two estimates will prove (I5.14p . 

To see (15.151) . let us assume that \Q\ = 1, which we can do as a dilation of K has the same 
Calderon-Zygmund norm as K. Observe that the commutator above is 

/K(*)-«6fe)}*(* -,)■<(»)*. 

Write the leading term in the integral as 

{ w q0) - w Q(y)} K ( x -y)= T ( x -y) + E (x, y) , 

where T(x — y) is the <ith degree Taylor polynomial of the left hand side, and the error 
term E(x,y) satisfies \E(x,y)\ < \x — y\ d+1 . That we have this estimate follows from our 
assumptions (15. ip on the kernel K. The wavelet Wq,, by choice of wavelet, is orthogonal to 
the Taylor polynomial, see (15.121) . Thus, as claimed, 



{w £ Q (x) - w e Q (y)}K(x - y) ■ w £ Q ,(y) dy 



< 



< 



E(x,y)w £ Q ,(y) dy 
x - y\ d+1 \w £ QI (y)\ dy 

(2)/ \ 



< rMl 1+1 ' d J 2 ) 

~ L |Q'| J 



□ 
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An Estimate for Riesz Transforms. For our use at the end of the proof of the lower 
bound on operator norms on Riesz commutators, we will need a more quantitative estimate 
on upper bounds of such commutators. This estimate is most convenient to state here. 

Proposition 5.16. For all integers a > 1, consider the operator 

V a (f,g)= £ E </><><^>[M <; ,R s H 

\Q'\=2"\Q\e',e£Sig d 

where R s is the sth Riesz transform on M. d . We have the estimate 
(5.17) ||U a || Para <2- Ma . 

This estimate holds for all a,M > 1, and all Riesz transforms, with implied constant only 
being a function of M, and the dimension d. 

The proof is a simple variant on the previous proof. Clearly the role of the signatures is 
unimportant, and we will ignore the sum on the signatures in the argument below. Note that 
U a is a paraproduct, with zeros falling on /, and zeros falling on g. Now, Q and Q' have 
different scales, which means that wq and wq> are not adapted to cubes of the same scale. 

This was exactly the problem addressed with the inequality (I5.14p above. However, the 
Riesz transform has an infinitely smooth kernel. Therefore, a stronger form of (I5.14p holds. 
Namely, for all m > 1, we have 

i[[m % „r s h(x)i < [^r^i+'-^r^Qr^i^r, \c?\=r\Q\. 

The implied constant depends only upon m, through the growth of the constants in the 
relevant estimates of the Riesz transform kernels. 

The proof of the estimate proceeds just as the proof of (15.141) . so we omit the details. The 
derivation of the proposition from this last estimate is routine. 



6. The Lower Bound 

We turn to the converse to Corollary 15.91 namely the Theorem below, which includes half 
of our Main Theorem. 

Theorem 6.1. We have the inequalities below, valid for all choices of d. 

INlBMO < IHlRicsz.p , ||fr||cone,p , 1 < p < OO . 

where the two norms are defined in Ii5.5\) and Ii5.8\) . For the inequality involving the cone 
norm, the implied constant depends upon the aperture of the cone. 
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Remark. It is enough to prove this inequality with the L 2 operator norm on the right hand 
side. If a commutator is bounded from L p to itself, then it is also bounded on the conjugate 
space LP , and so by interpolation bounded on L 2 . That is, we have the inequality ||6||iuesz,2 ^ 
II &|| Ricsz,p, valid for all 1 < p < oo. The same inequality holds for the Cone norm. 

For the rest of this paper, we will denote ||6||Riesz,2 by 1 1 1 1 mesz and similarly for the Cone 
norms. 

We use induction on parameters, namely the number of coordinates in d. The base case is 
t — 1. Coifman, Rochberg and Weiss [8] proved that ||6||bmo ^ ||fr|| Riesz- This is a well known 
result, with a concise proof. We find it necessary to prove the same inequality for the cones 
as an aid to proving the result about Riesz transforms. Indeed, it was this part of the proof 
that motivated the definition of the cone norm. 

In the case t — 1, we indeed have the inequality ||6||bmo ^ H&Hcone- This is a consequence of 
a deep line of investigation begun by Uchiyama [29] , in which both directions of the Coifman, 
Rochberg, and Weiss result were extended to more general Calderon-Zygmund operators. In 
particular, a result of Song-Ying Li gives us as a Corollary to his Theorem, this essential 
result, which completes our discussion of the base case t = 1 in our induction on parameters. 

Theorem 6.2 (Li [21]). In the case oft — 1, for all d > 1 and symbols b on M we have 

||&||BMO<||&||cone. 

In the inductive stage of the proof, we use the induction hypothesis to derive a lower bound 
on the commutator norms in terms of the BMO_i norm. In so doing, it is very useful to use 
the equivalent Weak Factorization Theorem. 

We then 'bootstrap' from this weaker inequality to the full inequality. Namely, we can 
work with a symbol b with BMO norm one, but with BMO_i norm smallH With b fixed, 
we select an appropriate commutator which will admit a lower bound on its operator norm. 
We select a test function which will show that the commutator has a large operator norm. 
Verification of this fact will depend critically on the assumption that the symbol has small 
BMO_i norm, and the Journe Lemma. 

6.1. The Initial BMO i Lower Bound. We assume that t > 2 and use the induction 
hypothesis to establish a lower bound on the Riesz and Cone norms of a symbol. This norm 
is in terms of our BMO norm with t — 1 parameters. 

Lemma 6.3. For t > 2, assume Theorem \6.1\ in the case oft — 1 parameters. Then we have 
the estimate 

(6.4) H&UbMO-x < IHIcone, 1 1 6 1 1 Ries z - 



'That is, the function b is of the type found in Carleson's examples 4J. 
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where it is to be emphasized that the BMO_i norm on the left is the BMO norm of t — 1 
parameters. 

Proof. We only give the proof of ||6||bmo_i ~ ll&llmesz explicitly. This proof uses an equiv- 
alent form of the induction hypothesis, namely a weak factorization result on H 1 in t — 1 
parameters. The same weak factorization result holds for Cone transforms. See Li [2T] for 
the one parameter formulation of this result. 

Using the notation of (11.31) . it is a straightforward exercise in duality to demonstrate that 

(6-5) BUp||Cj(6, 01138—2 ~ M^mQL^y 

1 

Therefore to show ( 16.41) . it is sufficient to demonstrate that the following inequality holds, 
(6-6) > II&IIhmo-x, 

and this will be established by relying upon the truth of the Theorem in t — 1 parameters. 

Given a smooth symbol b{x\, . . . x t ) = b(xi, x') of t parameters, we assume that ||6||bmo_i = 
1. As the symbol is smooth, the supremum in the norm is achieved by a collection of 
rectangles U of T>^oi t — 1 parameters. We can assume that the rectangles in U agree in the 
first coordinate, to a cube Q C M. dl . As there are free dilations in each coordinate, we can 
assume that \Q\ = 1 and |sh(W)| ~ 1. Then define 

ReU eeSigj 

One notes that (b, ip) = 1. To prove the claim, it is then enough to demonstrate that 
||^|| L 2( M d)g L 2( R d) < 1. Next observe that ip(x) = ipi(xx)ip' (x') and ip x G H 1 (R dl ) with 

IIV , i||fri(K d i) = 1- 

For the function ipi, we use the one parameter weak factorization of if 1 (IR dl ) of Coifman, 
Rochberg and Weiss [8]: There exists functions fi,g J n G L 2 {R dl ), n G N, 1 < ji < di, such 
that 

oo d\ 
n=l j\=\ 

where Ilij l (p,q) := Rij 1 (p)q + pRi,j 1 {q)- O ne next sees that if)' G -ff 1 (®; =2 K di ) with norm 
controlled by a constant. This follows from the choice of U and the square function char- 
acterization of the space if 1 ((S>|~jM di )- By the induction hypothesis in t — 1 parameters, in 
particular that ^((g^R*) = L 2 (®* =2 M^)0L 2 (®* =2 M ds ), we have G L 2 (®* =2 M ns ) 
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with m G N and j & vector with 1 < j s < d s for s = 2, . . . , t such that 

oo oo 
m=l j m=l J 

This immediately implies ( 16.41) since ^ = ipiip' , and we have a weak factorization of ip with 

7. The Bootstrapping Argument 

In this section, we assume that ||6||bmo_i < <^-i> i s ver Y small, for a constant < 5-i < 1 
to be chosen. Under this additional assumption, we conclude the proof of Theorem 16. 11 

This proof is intricate, and indeed at this stage we find it essential to first prove the result 
for cones, namely we first prove ||6||bmo H&Hcone- Elements of this proof in this case are 
essential to address the Riesz norm case. 



7.1. The Lower Bound on the Cone Norm. This case follows the lines of the argument 
of Lacey and Terwilleger [2D]. (The current argument is however somewhat simpler.) We 
make a remark about the cone norms with different apertures. Given a cone C with data 
(£) Q) where £ is a unit vector in M. d and Q is a cube, consider a second cone C with data 
(£',C"). We can map one cone into the other with an orthogonal rotation and a dilation in 
d — 1 variables. Thus, the corresponding Calderon-Zygmund kernels Kc and Kc> can be 
mapped one into the other by way of these same transformations. 

A rotation preserves the Calderon-Zygmund norm of the kernel, but the dilation does not 
since it is not uniform in all coordinates. Nevertheless, this observation shows that the Cone 
norms associated to distinct apertures are comparable. Thus, to prove our result, it suffices 
to demonstrate the existence of some aperture for which the Theorem is true. This we will 
do by taking a somewhat large aperture, that approximates a half space. 

For a choice of symbol b with ||6||bmo — 1 an d ||&||bmo_i < ^-i> there is an associated open 
set U for which we achieve the supremum in the BMO norm. After an appropriate dilation, 
we can assume | < |sh(ZY)| < 1. Let U = {R 6 X> d - : R C U}. 

For a collection of rectangles T define the wavelet projection onto T as 

iter eesig d - 

Define (3 = Pu b. We use this function to build a test function to demonstrate a lower bound 
on the Cone norm. 
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The purpose of the next steps is to select the cones we will use. This issue involves some 
subtleties motivated by subsequent steps in the proof. Of particular importance is that the 
selection of the cones be dependent only on dimension, as well as satisfy some particular 
estimates. It turns out to be a useful device to select two distinct cones: One will be used 
for the selection of the test function for the commutator, and the other for the cones we use 
to define the commutator. 



Given a cone C with data (£, Q), let He be the convolution operator with symbol l(o,oo)(£ • 
6), which is to say that He is the Fourier projection onto a half space associated with C. 

Lemma 7.1. Given d and k > we can select cones 

D s c C s c R ds , 1 < s < t . 

These cones have data (£ a , Q s ) and (£ s , Q' s ) respectively. They are, up to a rotation, only a 
function of d and k > 0, and they satisfy these properties. Defining 

(7.2) 7 = T Dl ---T Dt P 

we have 

(7-3) Il7l| 2 >4-*. 
(7.4) ||H Dl ---H 0t /?-7|| 4 <K. 
(7-5) ||(H Cl ...H Ct -P Cl ...P Ct )|7| 2 || 2 <«:. 

Notice that ( |7.^| ) estimates an L 4 norm; and that ( 7.5\ ) concerns the function |7| 2 , and we 
are estimating the difference between the projections onto the half spaces defined by the cones, 
and the projection onto the larger cones. 



Proof. We begin with the selection of the cones D SJ which is a randomized procedure. Fix a 
small constant < rj < j^. In each dimension M. ds , fix an aperture Q s so that the cone D s 
with this aperture satisfies 

p^n^- 1 1 s d °- 1 ) >\-rj. 

Here, denotes the sphere in M. ds endowed with the canonical normalized surface measure. 
The notation above is the standard way to denote conditional probability. 

Now, let D' s denote a random rotation of the cone D s . Taking expectations of L 2 norms 
below, we have access to the Plancherel identity to see that 

e||p d; • - -p D , = Cc[ e [ \p(0\ 2 d£> (| - v y . 

J Di®-®£>{ 

But also, we must have 

E||(H Dr ..H^-P or .-P D ,)/3||^<^. 
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View these statements about the L 2 norm of non-negative random variables. As concerns 
the first inequality, note that 

sup ||P Dr --P^/3||^<l, 

D[,...,D' t 

Hence, we see that 

n\\PD> 1 ---p D >m>m>(i-?i)\ 

P(|| (Ebi • ■ • Hflj - P D >- ■ ■ P D > t )P\\l > V t/2 ) < V t/2 ■ 
Therefore, for rj sufficiently small, we can select cones D[, . . . , D' t so that 

\\v D[ ---PD'j\\i>(\y 
\m D[ ---^ D[ -v D[ ---v D > t mi<r, t/2 . 

On the other hand, we automatically have 

|| (H^ ■■■E Dl -T D[ --- T D ' t )Ph < C\\P\\ B < K 

for an absolute constant K. Keeping in mind the fact that the symbol of a cone operator Tc 
is identically one on the cone C, we see that we have proved i \7.3h and (17.41) . 

We use proof by contradiction to find the cones C s . Fix the cones D s as above, and let us 
suppose that (17. 5p fails for some k > 0. Then, we can find a sequence of cones 

C k s C C k s +1 , 1 < s < t , k > 1 

with data Q' s k ), where the apertures Q' s k increase to all of IR ds_1 . 

We can also find functions (3k satisfying 

11/3*112 = 1, \\[3kh<K 8 , 
\\(R cr --R c ,-P cr --P c ,)\ lk \%>K. 

where 7^ is defined as in (17. 2p . The constant K s depends only on d, and the John Nirenberg 
inequality for BMO. 

In particular, as we can assume an upper bound on the L s norm of the (3k, the sequence 
of functions {(3k} are precompact in the L? topology. Letting (3^ be a limit point of the 
sequence of functions, and defining 7^ as in (I7.2p . we see that for all large k, 

||(H cr ..H cj ,-P cr ..P Ctfc )|7oori| 2 >«. 

But this is an absurdity, as in the limit, the symbol of this difference is supported on a 
subspace of codimension t. Therefore, (17.51) holds. 

□ 



MULTIPARAMETER RIESZ COMMUTATORS 



27 



The cones we form the commutator of are the C s of the previous Lemma. We will test the 
commutator against the function 7, where 7 is as in ( 17. 2(1 . 



By Journe's Lemma, in particular Lemma [3781 there will exist an open set V which satisfies 
the conditions of that Lemma. Set 

(7.6) V d = {R : R C V, R <£ sh(U)}. 

Finally, let W d = V s -U - V. 

The function 7 enjoys these properties of Lemma [771] as well as the ones below, which will 
conclude the proof. 



(7.7) 


ll[T Cl) - 


■■[T Ct ,M Pub ]- 


••]tI 


I9 > 1 

1 2 ~ x ) 


(7.8) 


ll[Tc 1; - 


••[T Gt ,Mp v6 ]- 


••]7l 


I2 ^ , 


(7.9) 


!I[t Ci ,-- 




■■ItI 


| 2 < Kj5-! 



Here < 5j < 1 is the constant associated with Journe's Lemma (called r\ in Lemma 13.81) . 
that is to be specified. Kj is a function of 5j. These estimates will lead to an absolute lower 
bound and prove Theorem 16.11 Namely, the implied constants in each of the inequalities 
depend only on d, while 5j and 5_i are free to choose. Certainly we can choose Sj first, and 
then with Kj specified in ( 17. 9p . select <5_i to prove our Theorem. 

Estimate (17.81) is straightforward. It is easy to see that 

|| [T Cl , ■ ■ ■ [T Ct ,M Pvb ] ■ ■ .}j\\ 2 < ||P V 6|| 4 ||t||4 < ||Pv b\U , 

where the implied constant depends upon the the L 4 norms of the Cone transforms. But, by 
Journe's Lemma [3.81 and construction, we have that 

||Pv&|| 2 <5y 2 , ||P v 6||bmo<1, 

which implies that 

iiPv&ii 4 <5y 4 . 

These together give (17.81) . 

We turn to the verification of (17. 7p and provide a lower bound for the L 2 norm below 

||[T Cl ,.-.[T Ct ,M^]---]7|| 2 . 

Recall that (3 = Pub and the definition of 7 in ( \7.2\\ . The commutator is a linear combination 
of terms T[/3 ■ T'7] , where T and T' are either the identity, or a product in Tc s , 1 < s < t. 
(Each Tc s must occur in either T or T'.) In each case that T' is a non-trivial product, we 
have T'7 = 0. It then remains to consider the only term not of this type, namely 

T Cl ..-T Ct [/3-7]. 
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Write (3 = 7 + (3 1 + (3", where the smaller cones from Lemma [7.11 enter again below. 

(3' = (B. Cl ■ ■ ■ Hc t - T Dl • • • T Dt )f3 , 

/r = (i-H Cl ---H Ct )/?. 



(7.10) 



Note that (17.31) provides information about (3' . We need to consider • • -Tctlft ' 7L which 
is now divided into three terms. They are 

T Cl • • • T Ct [(3 ■ 7] = T Cl ■ ■ ■ T Ct [(3" ■ 7] + T Cl ■ • -T Ct [/?' • 7] 

+ T Cl • ■ ■ T Ct [7 • 7] . 



Now, 7 and are supported on the same product of halfspaces, which are complementary 
to the cones, thus 



(7.11) 



- Ci 



•T Ct [/3".7]=0. 



For /?' we do not attempt to find any cancellation, just relying on the favorable estimate from 

(El. 



(7.12) l|Tc 1 ---T Ct [/3'-7]||2<||/3 / ||4-||7ll4<K. 
The last term holds the essence of this component of the argument. By (17.51) . 
||T Cl • • -T Ct [7 ■ 7]|| 2 - « > ||H Cl ■ ■ -H Ct [7 • 7]|| 2 



^ 117 ■ T II 2 



(7.13) 



£\|2 il/2 2 
R 



> 1. 

The second line follows as the Fourier transform of 7 ■ 7 is symmetric with respect to the half 
planes determined by the cones; the third line is obvious; the fourth line uses the Littlewood- 
Paley inequalities; and the fifth line uses the fact that the rectangles in U are in a set of 
measure at most one. This completes the proof of (17.71) . 



The remainder of the paper is devoted to proving (17. 9p which is taken up in the next 
section. 



The Proof of ( 17. 9j) . Theorem 14.71 will let us conclude estimate (17.91) . and the proof of 
Theorem 16.11 
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We have observed that the commutator we are considering simplifies considerably when 
applied to the function 7. Letting T = • • • Tc t i an d T' = T Dl ■ ■ ■ T Dt the estimate we are 
to prove is 

||T(P W b ■ 7)|| 2 = ||T(P W b ■ TP) || a < KjS-! . 

It is critical to observe that the outermost T imposes a cancellation condition similar to the 
one defining paraproducts. For R ElA and R' G W, and choices of signatures e, £ G Sig^*, we 
have 

T{w% ■ T' w e R ) =0 if for any 1 < s < t, \Q' S \ > 64|Q S |. 

Recall that we defined 7 = T'/3. T and T' are a convolution operators, so the Fourier 
support of Tw £ R is contained in the Fourier support of w e R . Therefore, this property follows 
immediately from the properties of the Meyer wavelet. 

Using this observation, we see the estimate to be proved is 

" <*-i, 



2 



(7.14) || Y, E (6^|)(&,^)T( W |,-T' W | 

e.eeSig^ (R,R')eA 

A = {(R, R') : R C U , V , |Q'J < 64|Q S | , 1 < s < t} . 

Notice that in the L 2 norm, we are free to remove the operator T, as it is a bounded operator 
on L 2 (IR d ~) . 

It is essential to observe that this last sum can be written as a sum of paraproducts, as 
in Theorem 14.71 The purpose of these next definitions is to decompose the collection A into 
appropriate parts to which Theorem 14.71 applies. For an integer n > 1, take 

U n = {ReU : 2™- 1 < Emb(R) < 2 n } , 

A n = {{R,R') : ReU n , R! £V , \Q' S \ < 64|Q S | , 1 < s < t} . 

Here, Emb is the function supplied to us by Journe's Lemma, Lemma 13.81 Hence, as a 
consequence we have 

||P^&||bmo < 2^^!, 

where C is a large constant depending only on d. Observe that for (R, R') e A n we necessarily 
have 2 n ~ 1 R D R' = 0. In particular, the assumption (14. 10p will hold with A ~ 2 n . From this 
separation, and the rapid decay of the Meyer wavelet, we will gain an arbitrarily large power 
of 2~ n . Thus the presence of the term 2 Cn in this last estimate turns out not to be a concern 
for us. 

Our estimate below is a consequence of (14.111) . after a further decomposition of the sum to 
account for the role of the location of the zeros, controlled by the set J in (14. lip . 



(7.15) || J2 E &«$<m£>(«&-t'«&) 



< 2-^_! , n > 1 . 
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Summation in n > 1 will then prove (I7.14p . 

Specifically, let Jc {1, . . . , t}, and let integer fceZ' satisfy 

k s = 8, s£ J, -8 < k s < 8 , s e J . 
Let A j g be a subset of A n given by 

A,j,k = i( R ,R') e A : |Q'J < 2~ 8 |Q S | , s^J, \Q' a \ = 2 k °\Q s \ , s G J} 
For this collection, the estimate 

e^esi (V (fl I Ji')e^ iJiS 



2 



is then a consequence of (14.111) . This estimate is summed over J C {1, . . . , n} and to prove 
(H35D- 

7.2. The Lower Bound on the Riesz Transforms. 

Properties of Riesz Transforms. We need some special properties of Riesz transforms. Vari- 
ants are in the paper of Coifman, Rochberg and Weiss [8]. 

Proposition 7.16. For each s, let T s be a polynomial in the Riesz transforms on IR ds . Then, 
we have the inequality 

||[...[M 6 ,Ti],...],T t ]|| a ^ 2 <Bup||C 3 <6,0llM- 

3 

The implied constant depends upon the choice of polynomials T s . 

This in fact follows from the elementary identity 

[M 6) Rj R fc ] = [M 6 , Rj] R k + Rj[M b , R k ] . 

An operator T which is a polynomial in Riesz transforms is a convolution operator, with 
radial symbol. Below, we will only describe the symbols that we are interested in. 

The selection of the operators which are polynomials in Riesz transforms is hardly obvious, 
and we identify their properties in the following Lemma. 

Lemma 7.17. Given any < rj < 1 and any cone C in M. d , there is an operator XJc, a linear 
combination of the identity and a polynomial in Riesz transforms on M> d , with symbols Vq 
such that 



(7.18) 



\v c (0-M<V ZtC 
\v c (0\<v 
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Finally, we have the estimate 

(7.19) l|Udl P <C p , Kp<oo. 

The constant C p is independent of the choice of the cone C and dimension d. 

Proof. This depends upon particular properties of spherical harmonics and zonal polynomials. 
We were aided by |2j in our search for this proof. 

It suffices to prove the following. For a choice of cone C in IR d , with direction £<7, and 
< 7] < 1, we can choose operator U with symbol v which is odd with respect to £c, 
llf lino < 2, and 



^7.20) 



\v(C) + l\<v t£-C 



Finally, v restricted to the unit sphere is a polynomial in ^ for (^1,^2, • • • , £<z) £ S . We 
will see that the degree is at most < r]~ l log 1/rj. 

Then, U is in fact a polynomial in Riesz transforms. Since v is odd, the method of rotations 
applies to provide us with an estimate of ||U|| P < C p , where C p is absolute for 1 < p < 00. 
To get an operator with symbol as in our Lemma, we add the identity operator to U. 

We obtain the symbol v by employing the Poisson kernel in the ball in M d , and as well 
an expansion of this kernel into zonal harmonics. Let us recall the properties we need. The 
Poisson kernel in R d is 

^>0 = ^p, M<i,KI = i. 

A homogeneous harmonic polynomial p of degree m on M d has the reproducing formula 

m p- 97] 



p(x) = \x\ m / p(C)Z m (x,()a(dO. 
Js d ^ 

Here, a denotes normalized Lebesgue measure on the unit sphere S d ~ x in IR d . The polynomial 
Z m (x, () is a zonal polynomial of degree m. It follows that the Poisson kernel admits an 
expansion in terms of these polynomials 

00 

m=0 

This series is absolutely convergent sum in the interior of the unit ball, thanks to the ele- 
mentary estimate 

(7.21) \Z m (xX)\<m d -M m - 
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It is a basic property of the zonal polynomials that they are only a function of \x\ and x ■ (. 
Indeed, they can be expanded as 



[m/2] 



m-2k\ \2k 



k=0 



Here, Cfc >m are known real coefficients. In particular, for a function v on the unit sphere that 
is odd, the new function 



v(C)z m (x,o MO 



is also odd, for \x\ held fixed. 



To construct our operator U, via its symbol v, recall that the operator U is associated to a 
cone C with direction £c- On the cone C and the opposite cone — C we require rather precise 
information about the symbol v. Outside of these cones we only require an absolute upper 
bound on v. Hence, we have some freedom in taking an initial approximate to the symbol v. 
In what follows, we concentrate on defining the symbol on the sphere S^ -1 . 

Take as an initial approximate v(£) = sign(£ • For a small constant c, consider the 
function 

v(0) £(C)P((1- 000,0 MO ■ 

This function will be non- negative, odd, bounded in absolute value by 1, and satisfy (17.201) . 
It is not however a polynomial in spherical harmonics. 



But each function 



vm(0) = I v{QZ m {{\- cq)6,0M0 



is also odd, as we have already noted. By (I7.2ip . we have the estimate 



oo 



v m <^/4, m = C(\ogl/r])/r) 



m=mo 



where C depends upon the dimension d. Therefore, the function v we need can be taken to 
be 

mo 
m=0 

Our proof is complete. 

□ 
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The Selection of a Test Function. We continue to assume that the symbol b satisfies ||6||bmo 
1 while ||6||bmo_i < We follow many of the initial stages of the proof of the lower bound 
on the Cone norm. We choose cones D s C C s , and cone operators T£) s ,Tc s , for 1 < s < t 
just as in Lemma 17. II 

We continue to use the notations U, V and W, thus |sh(W)| ~ 1 and 

^|(6,^)| 2 ^|sh(W)|. 

Reu 

V and W are defined as in (I7.6p . and P = Pub. As before, we set 7 = T^ • ■ • T Dt (3. 

For < i] < 1 to be chosen, apply Lemma 17.171 obtaining operators T s which are a linear 
combinations of the identity and polynomials in Riesz transforms on IR ds which approximate 
the projection operator Pc s in the sense of that Lemma. Let us see that we have the estimate 

(7.22) ||[T 1 ,---[T t ,M /3 ]---]7|| 2 >l. 
The commutator is a linear combination of 2* terms of the form 

T[/3T'7] 

where T and T' are either the identity or a composition of the operators T s . If T' is not the 
identity, it follows that the symbol of T' is at most rj on the Fourier support of 7. Therefore, 
we can estimate 

l|T[/3T'7]|| 2 < ||/?T'7||2< ||/5||4||T'7||4<V /3 - 

This last estimate follows from || T' 'y || 2 ^ Vj while ||T'7|| 8 < 1. This point is imperative, and 
follows from the uniform LP bounds we obtain from Lemma 17.171 

This leaves the term T x • • 'T^ffi. But, for a sufficiently small choice of 77, we are free to 
use the same argument as in (17.131) . This proves (17.221) . 

It then follows from Proposition 17.161 that for some choice of Riesz transforms R Js on IR ds 
we have 

I|C(/3,7)I| 2 >1, 

(7.23) , , 

where C(f,g) = • • • [R it , M f ] ■ ■ -]g . 

Now, it also follows that 

(7-24) l|C(P v &,7)l| 2 <5y 4 . 

Indeed, we can appeal to the same argument as used to prove (17. 8p . 

Finally, we claim that 
(7.25) ||C(P w fc,7)ll2 <<*_!. 
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This estimate requires the same argument as for (17.91) . plus an additional estimate; the details 
are below. The three inequalities (17.231) . (17.241) and (I7.25P are then combined in in the same 
manner as in the proof of the lower bound on the Cone norm to complete the proof. 



Proof of \7.25 ). The different quantitative estimates we have for paraproducts are brought 



to bear on this estimate. First, we expand the expression C(P>v&, 7) into the sum of com- 
mutators on different pairs of wavelets. This sum is further written as D\ + D2, where we 
define D\ explicitly here. 

def 



D i = E E (7,0(6,4)^.^ 

(7.26) e,eeSi g<i (R,R')eA 
A={(R,R'):ReU,R'(£V, \Q' S \ < 6A\Q S \ ,l<s<t}. 

This is the part of the commutator that most closely resembles the part arising from the 
commutator arising from the Cone operators. 

It is essential to observe that this last sum can be written as a finite sum of compositions of 
Riesz transforms and the "paraproducts" in Theorem 14 .7\ and in particular the more technical 
estimate (j4.11j) . applied to the functions b and Pyyb. We also comment that the Riesz 
transforms applied to the wavelet element w £ R do not substantially change the localization 
properties of the wavelet, and thus the Riesz transforms do not spoil the estimates that appear 
in Theorem 14.71 This sum varies of choices of k with || &||oo < 8, an d arbitrary Jc {1,. ..,£}. 
(The subset J consists of those coordinates s for which \Q S \ = 2 ks \Q' s \.) 

We will need to decompose the collection A into appropriate parts to which this estimate 
applies. That is the purpose of this definition. For an integer n > 1, take 

In = E E (J, w r) w R 

eeSig,- Rcu 

2™- 1 <Emb(fi)<2 n 

We claim that 

(7.27) ||C(P W &,7^)|| 2 <2-^_ 1 . 
It follows from Lemma [3.81 that we have the estimate 
( 7 - 28 ) ll7n|lBMO(M<?) ~ 2 2 ' n 5_i , 

indeed, this is the point of this definition. From other parts of the expansion of the Riesz 
commutator we need to find some decay in n. 

Nevertheless, from this estimate and the upper bound on Riesz commutator norms, we 
have the estimate 

||C(P W 6,%)||2< ll&llBMO^)H7n||2<2 2t ^ 1 . 

We use this estimate for n < 20, say. 
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For n > 20, R E U with < Emb( J R) < 2 n , and rectangle R! with (R, R') E A, it 
follows that we must have 2 n ~ 9 R fl i?' = 0. That is, (14.101) is satisfied with the value of A in 
that display being A ~ 2 n for n > 20. Thus, we conclude that 

\\C(P w b,%)h< 2- 5 n_ 1; n>20. 
This completes our proof of (17.271) . and the proof estimate (17. 9ft . 

It remains to estimate the term D 2 . The principal tool here is the estimate for Riesz com- 
mutators given in Lemma 15.161 and in order to apply this lemma, as well as take advantage 
of our remaining freedom to select the 5_i norm, we need a sophisticated decomposition of 
the sum that controls D 2 . That is the point of these next definitions. 

Let m be an integer. For a non empty subset J C {l,...,t}, and choices of integers 
a = (aj)j e j with aj > 7, we define 

D(m,J,a)= (7>™fj) (M&> C(w|,,w|) 

e.e'GSig^ (R,Ri)£A(m,J,a) 

(7 29) 

A(m, J, a) = {(R, R!) : R E U , R' <£ V ; \Q' B \ < 64|Q S | , s J ; 

\Q' S \ = 2 a \Q s \ , s E J; 2 m < Emb(i?) < 2 m+1 } . 
With this definitions, we will have 

D 2 = D(m, J, a) . 

m,J,a 

The estimate below holds. 

(7.30) \\D(m,J,a)\\ 2 <2- m -^ a °5_ 1 . 

This is summable over the parameters m, J, a, and so completes the proof of the estimate for 

Recall that we have the essential consequence of Journe's Lemma. 
(7-31) I J2 E^^)^ RMn ^ 2C ^- 

Reu eeSig,- 
2 ra <Emb(_R)<2 m+1 



BMO 



The first subcase occurs when we have 
(7.32) maxa s <^. 

sG J 

It follows from the definition of embeddedness that Q s / n 2 m / 4 Q s = for all (R, R') E 
A(m, J, a). Therefore, the function 



(7.33) 2' Nm ^\Q^\C(v l ,. i r 
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is adapted to Q s , where N > 1 can be taken arbitrarily. This sum can be understood as a 
paraproduct, with symbol given by Pyy&, applied to 7. Zeros fall on 7 for those coordinates 
s E J. Using ( 17.3 ip and the estimate ( 17. 32ft . we see that (I7.30P holds in this case. 

The second case is when (17.321) fails in any coordinate, say sq G J. In this instance, 
we see that the sum we are considering in that coordinate is of the type considered in 
Lemma [5.161 That is, the sum is an operator whose paraproduct norm, as defined in (I4.16P 
is at most 2~ Naa o, for arbitrarily large N. In all other coordinates, the sum is an operator 
with paraproduct norm at most a constant. The tensor product of paraproducts is a bounded 
operator, therefore in this case, we have 

\\D(m, J, d)\\ 2 < 2- N ^2 c ^ 1 < 2- 2 ^5_ 1 . 

Here of course, we again rely upon ( I7.3ip . This completes the proof of ( I7.30p . This in turn 
completes the proof of the lower bound on the norm of multiparameter Riesz commutators. 
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